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DESCRIPTION 



DEBLOCKING FILTER ARITHMETIC APPARATUS 
AND DEBLOCKING FILTER ARITHMETIC METHOD 

TECHNICAL FIELD 

The present invention relates to a deblocking filter 
arithmetic apparatus and a deblocking filter arithmetic 
method for removing block noises, which is one of the 
2 post processings of reconstructed pixel data employed 




: {]tl in an image signal processing or the like, and relates 



SJ to a deblocking filter arithmetic apparatus and a 



yp deblocking filter arithmetic method which carry out an 



P arithmetic in a DC offset mode that is a deblocking 
JlJ filter arithmetic which is standardized by M PEG 4 - 



picture data by employing a band compression technique, 
there are MPEG systems (Moving Picture coding Experts 
Group) discussed and standardized by ISO- 
IEC/JTCI /SC2/WG11 . In these MPEG systems, it is 
fundamental to utilize the int r a -pi c tur e correlation, 
to divide a picture into blocks each comprising plural 
pixels, perform a discrete cosine conversion as an 
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;ffl 
□ 



BACKGROUND OF INVENTION 



As a method of encoding or decoding moving 



2 



orthogonal conversion systems and a processing of 
quantization and Huffman coding to data of each block, 
thereby realizing the compression of image data. The 
pixels which are obtained by performing those 
processings can not be reproduced the original pixels 
prudently even if the inverse processing is performed, 
as a result, the pixels existing at the boundary parts 
of adjacent two blocks have different values. 
Thereby, block noises are generated. 

o 

; |p In MPEG 4 , a deblocking filter has been 

HI standardized as a count e rmea s u re against this problem. 
'SI 

SJ This filter is constituted to have two kinds of operation 
=*0 modes, in a case where the one dimensional window having 
P the block boundary at its center is provided as shown in 

Si 

Hi figure 4. In figure 4, g(n) (n is an integer from 0 to 
^ 9) denotes pixel data of ten pixels next to one other with 
the block boundary at the center. This filter performs 
the adaptive switching of those operation modes according 
to the activity of the block boundary neighboring pixels. 
The following evaluation function is employed for the 
switching of the operation mode. 
f=<*{g<0)-g<l)}+*{g(l) 9(2)) 
4- <f> { g ( 2 ) - g < 3 ) } + * { g 1 3 ) - q ( 4 ) I 
I <f> ( g ( 4 J — g ( 5 ) ) + <» { g ( 5 ) - g ( 6 ) } 
+ <f> I g ( 6 ) - g ( 7 ) } + * { g ( 7 ) - g l 8 ) J 



+ $ { g < 8 ) — g ( 9 ) I ; 

here, i f (I x I <=Thl (=2)) 

6 ( x ) = 1 ; 
else 

© ( x ) = 0 ; 

Th denotes a threshold value. The operation mode is 
switched as follows employing the evaluation function, 
i f i f >-Th2 ( = 6 )) 
DC offset mode ; 

■3 else 

.m 

- 3! J 

Ul default mode ; 

SI 

M The DC offset mode shown here is an operation mode in case 

SI 

"tS where the changing of Die pixel data existing at the block 
P boundary is calm, and the default mode is an operation 
^ mode in case where the changing of the pixel data existing 
0 at the block border is drastic. With respect to the DC 
offset mode, the filter shown in the following is defined. 
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coef ( 9 ) — 1 ; 

Then, the following filtering processing is performed to 
obtain the processed pixel g ' (m ) (m= 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 ) 
MAX = max(g(l), g(2), g(3). g ( 4 ) . g ( 5 ) . g I 6 ) , g(7 ), 
g ( 8 ) ) ; 

MIN=mi n <g ( 1 ) , g(2), g { 3 ) . g(4). g(5), g(6) f g < 7 ) > 
g ( 8 ) ) ; 

i f (IMAX-MIN | < 2 * Q P ) 
{ 

□ 

^win padding = | g < 0 ) - g { 1 ) I <QP ?g(0) : g ( 1 ) ; 
:! max_padding= I g(8)-g(9) | <QP?g(9) : g ( 8 > : 
^q'(m]=0 ; 

w forri= s -4;i<5;i+ + ) 

H g' (m) + =coef ( i + 4 ) * 

' w 

pi ( m + i < 1 ? min_padding : 

Zl (m+ i > 8 ?max padding : g(m + i ) ) ) ; 

: i — — 

g ' < m > = n i n t < g • { m ) / 1 6) ; 

} 

else 

g ' ( m ) = g { m ) ; 

Here, QP denotes a quantization parameter of a macro block 
to which a pixel value of g(5) belongs. Further, 
min_padding and max_padding are, as defined in the 
above-described equations, values which are obtained 
respectively from the first pixel data g(l) and the eighth 



pixel data g(8), and the pixel data g < 0 ) and the pixel 
data g(9) # which are adjacent at outside the pixel data 
g(l) and the pixel data g(6), respectively. This 
filtering is performed to all horizontal edges, and 
thereafter, performed to all vertical edges. In this 
filtering processing, when 
for(i=-4; i<5; i+ + ) 

g ' ( m ) H coef ( i +4)* 

(m + 1 < 1 ? mi n_padding : 

□ 

iD ( m + i > 8 ? ma x_p adding : a ( m t 1 ) ) ) : 

j^j g ' ( m ) = n i n t (g'lml/l 6) ; 

[ SJ - are carried out by software in a general purpose 
arithmetic device included in the processor, the flow as 
H shown in figure 3 is obtained. 

'jp Hereinafter, the flow shown in figure 3 will be 

^ described. At first, by a start instruction, the value 
m out of eight pixels g (m) , to which the arithmetic 
processing is to be carried out, is set (step SI) . Next, 
the value i is set (step S2). As the initial value for 
i, i=-4 is set. Subsequently, whether i is 5 or not is 
detected (step S3) , and when i is not 5, m and i are added 
(step S4) . Then, when m+i is smaller than 1 in step S3, 
min_padding data is written into the memory and 1 is added 
to i (step S6) . Further, when m + i is larger than or equal 
to 1, m+i and 8 are compared in step S7, and when m + i is 




larger than 8, ma.x_padding data is written into the memory 
and 1 is added to i (step S8) . When m+i is smaller than 
or equal to 8, g (m+i ) data is written into memory and 1 
is added to i (step S9) . For each value of m, these 
operations are repeated nine times in total, i.e., while 
the value i changes from -4 to 4, and when the value of 
i is 5, multiplication and addition are performed to nine 
data which have been written into the memory until then 
(step S10), and the result is shifted by 4 bits in step 

□ 

^Sll, thereby a result obtained by performing the 

■B 

filtering processing, i.e., the pixel data obtained by 
^performing the filtering processing are output. 
^ However, in the filtering processing performed by the 
conventional general purpose arithmetic device, a result 
2 is output by repeating by nine times in total for i from 
^ -4 to 4, for each value of m. in order to output this 
one result, 67 cycles are required at the maximum. In 
addition, there are 8 values from 1 to 8 for the value 
of m, and therefore, 536 cycles that is obtained by 
mu Itiplying 67 cycles by 8 are required, thereby 
resulting in an increase in the arithmetic cycle number, 
which further results in the processing. 

The present invention is made to solve the above- 
described problems, and has for its object to provide a 
deblocking filter arithmetic apparatus having a less 



arithmetic cycle number. 

DISCLOSURE OF INVENTION 

A deblocking filter arithmetic apparatus 
according to the present invention includes a first to 
eighth arithmetic blocks which receive, as inputs, 
simultaneously every two adjacent data among a first to 
eighth pixel data, carry out one of the cycles of the 
processing arithmetic constituting the filtering 
processing corresponding to the first, to eighth pixel 
data and performed for removal of block noises, every time 
when two of the pixel data are input, and output the 
respective pixel data having been subjected to the 
filtering processing, being provided in parallel 
corresponding to the first to the eighth pixel data and 
to which two of the pixel data are input simultaneously, 
an output selection circuit which selects one from the 
outputs from the first to the eighth arithmetic blocks 
to output the same, and a control circuit which controls 
the processing arithmetic of each arithmetic block, in 
accordance with the cycles of the processing arithmetic, 
so that the cycles of the processing arithmetic performed 
in each combination, among a combination of the first and 
the second arithmetic block, a combination of the third 
and the fourth arithmetic block, a combination of the 



fifth and the sixth arithmetic block, and a combination 
of the seventh and the eighth arithmetic block, 
respectively constituted by above -described arithmetic 
blocks up to the conclusion ot the respective filtering 
processing should be the same and, further that the 
filtering processing of each combination should be 
concluded in an order successively among the respective 
combinations, and which controls the output selection 
circuit so as to select the output from the arithmetic 
■\y blocks in a unit of the combination of the arithmetic 

m 

■^blocks and to perform pipeline output . Therefore, the 
N processing cycles of arithmetic by each arithmetic block 
^ can be reduced and the arithmetic results of each 
^- combination of the arithmetic blocks can be output by a 



■■m 
Q 



pipeline output, whereby it is possible to result in 
reductions in cycles of the arithmetic processing. 

According to the present invention, there is 
provided a deblocking filter arithmetic apparatus 
wherein, the arithmetic block comprises a first selection 
circuit which selects, according to the cycle of the 
processing arithmetic, any one among one of two pixel data 
input simultaneously, two values obtained respectively 
from the first pixel data and the eighth pixel data, and 
pixel data which are adjacent at outside the first pixel 
data and the eighth pixel data, and value 0, a second 



selection circuit which selects, according to the cycle 
of the processing arithmetic, any one among the other of 
the two pixel data input simultaneously, two values 
obtained respectively from the first pixel data and the 
eighth pixel data, and pixel data which are adjacent at 
outside the first pixel ddLa and the eighth pixel data, 
and value 0, a third selection circuit which selects an 
arithmetic result for the accumulation or value 3, a first 
shifter which receives, as input, the output of the first 
^ selection circuit, a second shifter which receives, as 
y= input/ the output of the second selection circuit, an 
adder which adds together the output of the first shifter, 
the output of the second shifter and the output of the 

q third selection circuit, a register which receives, as 

.Hi 

^ input, the output of the adder and outputs the same as 
Lj, an arithmetic result for the accumulation to the third 
selection circuit, and a third shifter which receives, 
as input, the output of the register and outputs the same 
as an arithmetic result to the output selection circuit. 
Thereby, it is possible to result in reductions in cycles 
of the arithmetic processing- 
According to the present invention, there is 
provided a deblocking filter arithmetic method 
comprising: a step for receiving, as inputs, 
simultaneously every two data among a first to an eighth 



successive pixel data and carrying out in parallel the 
processing arithmetic constituting the filtering 
processing corresponding to the first, to the eighth pixel 
data and performed for removal of block noises, so that 
the cycles of the processing arithmetic performed in each 
combination, among a combination of the first and the 
second arithmetic block, a combination of the third and 
the fourth arithmetic block, a combination of the fifth 
and the sixth arithmetic block, and a combination of the 
iQ seventh and the eighth arithmetic block, respectively 
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0] constituted by above-described arithmetic blocks up to 
SI the conclusion of the respective filtering processing 
^ should be the same and, further that the filtering 
^ processing of respective combinations should be 
jj* concluded in an order successively, and a step for 
P performing a pipeline output of pixel data which are 
obtained by the above -de s cribe d step with having been 
subjected to the filtering processing, for the respective 
combinations of the pixel data in an order successively. 
Therefore, the processing cycles of arithmetic by each 
arithmetic block can be reduced and the arithmetic 
results of each combination of the arithmetic blocks can 
be output by a pipeline output, whereby it is possible 
to result in reductions in cycles of the arithmetic 
pr oces sing. 



BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 is a block diagram of a deblocking filter 
arithmetic apparatus according to the present invention, 
figure 2 is a diagram for explaining an operation of the 
arithmetic blocks corresponding to m= 1 - 4 in the 
deblocking filter arithmetic apparatus according to the 
present invention, figure 3 is a flowchart illustrating 
an operation of a conventional deblocking filter 

^arithmetic apparatus, figure 4 is a diagram illustrating 

-13 

^Lhan arrangement of pixel data with a block boundary at the 

"N 

^center, for explaining the operation of the conventional 

"S5 

^deblock filter arithmetic apparatus, figure 5 is a 

H 

^diagram for explaining an operation of arithmetic blocks 
^corresponding to m=5-8 in the deblocking filter 
H arithmetic apparatus according to the present invention, 
and figure 6 is a diagram for explaining an output 
operation of the deblocking filter arithmetic apparatus 
according to the present invention. 



BEST MODE TO EXECUTE THE INVENTION 

Figure 1 is a block diagram illustrating the 
construction of a deblocking filter arithmetic apparatus 
according to an embodiment of the present invention. The 
deblocking filter arithmetic apparatus according to this 



embodiment is one that performs the following arithmetic, 
which is a part of the deblocking filter arithmetic: 
f o r li = -4 ; i <5 ; i + +) 
g 1 ( m ) + = coef { i f 4 ) * 

(m + 1 < 1 ?min padding: 
( m + i > 8 ? max_paddlng : g ( m I 1 ) ) ) ; 
g , (m) = n int(g'(m)/16); ■ • • equation! 1 ) 

In this figure, a control circuit 1 performs counting of 
^ the processing cycles from 0 to 7 and outputs the count 
lvalue to the first to eighth arithmetic blocks 1 01-108 
Q corresponding to a first to an eighth pixels m (m is an 

■y integer from 1 to 8) as targets of the filtering 

d 

g processing, and to an output selectiny circuit 7 which 

□ 

SJ selects one from the outputs from the first to eighth 
gi arithmetic blocks 101-108. The arithmetic block 101 

car 

U comprise a first and second selection circuit 111 and 
121, which receive as inputs at least two out of 
max_padding 3 and min_padding 4, which are produced 
outside beforehand and input, g{x) 5 and g(x+l) 6, which 
are actual 8 bits data, and data "0", a third selection 
circuit 131 which receives, as inputs, data w 8" and the 
output of a register 171, which register is described 
later, a first and second shifter 141 and 151 which shift 
the selection results by the first and the second 
selection circuit 111 and 121 respectively, an adder 161 



which adds the outputs of the first and the second shifter 
141 and 151 to the output from the selection circuit 131, 
a register 171 to which the output from the adder 161 is 
input, and a third shifter 181 which receives, as inputs, 
the output of the register 171 and shifts it to output 
the result to the output selection circuit 7. The g(x) 
5 and the g(x+l) 6 are input through the 16 bits bus. The 
max_padding 3 and the min_padding 4 are, as shown in the 
equation at the background of art, values obtained 

^respectively from the first pixel data g(l) and the eighth 
pixel data g(8) among the first to eighth pixel data g(m) 
as targets of the filtering processing and the pixel data 

_ g(0) and the pixel data g(9) which are adjacent at outside 
■ 5g the first pixel data g(l) and the eighth pixel data g(8) 

Hi 

q\ respectively. The arithmetic blocks 102-108 have the 
same construction as that of the arithmetic block 101, 
and they comprise respectively the first selection 
circuits 112-118, the second selection circuits 122-128, 
the third selection circuits 132-138, the first shifters 
142-148, Ihe second shifters 152-158, the adders 162-163, 
the registers 172-178 and the third shifters 182-188. 
The selection circuit 7 selects one from the outputs from 
the first to the eighth arithmetic blocks 101-108 to 
output one as output 8 . 

Figure 2, figure 5 and figure 6 are diagrams for 




explaining the operation of the deblocking filter 
arithmetic apparatus according to the embodiment of the 
present invention. More particularly, figure 2 shows 
the relationship between the arithmetic processing 
results of the arithmetic blocks 101-104 and the count 
value of the control circuit 1, figure 5 shows the 
relationship between the arithmetic processing results 
of the arithmetic blocks 105-103 and the count value of 
the control circuit 1/ and figure 6 shows the relationship 
m between the outputs to the output selection circuit 7 from 

U1 the respective arithmetic blocks 101-108 and the count 

"S3 

value. In these figures, ra=l-8 correspond respectively 

Sa 

*S to the arithmetic blocks 101-108 and M-U present Lhe 

^ arithmetic results of the arithmetic blocks 101-108. 

Sj 

Further, black circles represent that g(x) 5, g(x + l) 6, 

'91 

■fj; max_padding 3 and min_padding 4 , which are input data 
respectively, are input. AU presents g(x) 5, ALpresents 
g{x+l) 6, CRL presents min_padding 4, and CRU presents 
max_padding 3, respectively. A reference "<<" presents 
a bit shift. 

A description is given of an operation of the 
deblocking filter arithmetic apparatus thus constituted 
with reference to figure 1, figure 2, figure 5 and figure 
6 . 

The control circuit 1 shown in figure 1 performs 



counting up from 0 to 7 by a start signal 2 and repeats 
counting of 2 to 7 thereafter, and as shown in figure 2 
and figure 5, it makes the pixel signals g(l) and g{2) 
input when the value of the counter value CNT is 0/ the 
pixel signals g(3) and g(4) input when the value of CNT 
is 1, the pixel signals g(5) and 9 (6) input when the value 
of CNT is 2, the pixel signals g(7) and g(8) input when 
the value of CNT is 3, the pixel data g(7) and g(8) input 
when the value of CNT is 4, the pixel data g(7) and g(8) 
a! input when the value of CNT is 5, the pixel signals g(9) 

i 

ii! and g(10) input when the value of CNT is 6, and the pixel 



\ [ signals g (11) and g(12) input when the value of CNT is 7, 

respectively successively, 
'pj In the arithmetic block 101, as shown in figure 2, 

■H* when the value of CNT is 0, g(2) and g(l) arc selected 
P by the first and the second selection circuit 111 and 121 
respectively, M 8 " is selected by the third selection 
circuit 131, and the input data is shifted by 2 bits by 
the second shifter 151 and by 1 bit by the first shifter 
141, the output of the first shifter 141, the output of 
the second shifter 151 and the output of the third 
selection circuit 131 are added together by the adder 161 
to be stored in the register 171. The data M which is 
obtained as an arithmetic result of this processing cycle 
is M=AU<<2+AL<<l+8 , as shown in figure 2. 



Next, when the value of C NT results in 1, g(4) and 
g(3) are selected by the first and the second selection 
circuit 111 and 121 respectively, and M, which is the 
arithmetic result when the value of CNT is 0, and is input 
from the register 171 to the third selection circuit 131, 
is selected by the third selection circuit 131, and the 
input data is shifted by 1 bit by the second shifter 151 
and not shifted by the first shifter 141, the output of 
the first shifter 141, the output of the second shifter 
\0151 and the output of the third s el ect i ori ci r cu i t 131 are 

m 

Uladded together by the adder 161 to be stored in the 

Si 

^'register 171. The data M which is obtained as an 
^arithmetic result of this processing cycle is 

E 

^M=AU<< 1 +AL + M (M represents the arithmetic result when the 

%! 

|value of CNT is 0), as shown in figure 2. 

Subsequently, when the value of CNT results in 2, 
min_padding and g(5) are selected by the first and second 
selection circuit 111 and 121 respectively, and M, which 
is the arithmetic result when the value of CNT is 1 and 
is input from the register 171 to the third selection 
circuit 131, is selected by the third selection circuit 
131, and the input data is not shifted by the second 
shifter 151 and is shifted by 1 bit by the first shifter 
141, the output of the first shifter 141, the output of 
the second shifter 151 and the output of the third 



selection circuit 131 are added together by the adder 161 
to be stored in the register 171. The data M which is 
obtained as an arithmetic result of this processing cycle 
is M=AU + CRL<<1 + M (M represents the arithmetic result when 
the value of CNT is 1), as shown in figure 2. 

Subsequently, when the value of CNT results in 3, 
n\in_padding and min_padding are selected by the first 
and the second selection circuit 111 and 121, and M, which 
is the arithmetic result when the value of CNT is 2 and 

.=3. 

UB is input from the register 171 to the third selection 

L#l circuit 131, is selected by the third selection circuit 
SJ 

^ 131, and the input data is shifted by 1 bit by the second 

M 

shifter 151 and is shifted by 1 bit by the first shifter 

.a 

^ 141, the output of the first shifter 141, the output of 

Of the second shifter 151 and the output of the third 

'01 

pi 

™ selection circuit 131 are added together by the adder 161 
to be stored in the register 171. The data M which is 
obtained as the arithmetic results of this processing 
cycle is M=CRL<< 1 +CRL< < 1 +M (M represents the arithmetic 
result when the value of CNT is 2), as shown in figure 
2 . 

In this way, by repeating the arithmetic as shown 
in figure 2 until the value of CNT results in 3, 
8+min_padding+min_padding+min_padding<< 1 +min_padding 
<<l+g(l)<<2 + g(2)<<l + g(3)<<l + g(4)+g(5), which is 



the final result of the arithmetic of 4 cycles in total 
of the ariLhiuetic block 101, is shifted by 4 bits in the 
right direction by the third shifter 181 and is output. 

The arithmetic blocK 101, thereafter, repeats the 
same processing as that described above from the timing 
when the counter has counted two processing cycles. 

Hereinafter, the operation as shown in figure 2 and 
figure 5 is carried out on the basis of the count value 
by the arithmetic blocks 102-108 which have the same 

□ 

& construction as that of the arithmetic block 101 also for 

■m 

: Ln m=2-8 . 

N The operation of the respective arithmetic blocks 

si 

^ 101-108 will be described in detail- 

■■P=e 

H Initially, in the arithmetic block 101 

sj 

l ~i corresponding to m-1, the value of the above -described 
^ equation (1) when the value of i is changed from -4 to 
4 results in 1 xmin_padding when i =-4, 1 x min_padding when 
i=-3, 2 xmin_padding when i=-2 , 2 *min_pa ddi ng when i =-1, 
4xg(i) when i =0, 2xg(2) wheni = l, 2 * g ( 3 ) when i = 2 , l*g(4) 
when i = 3, l*g(5) when 1=4. 

Among them, the cases where i = -4 and i = -3 are 
collected to be represented by 2 *min_padding , and is 
added to g<5) when i = 4 . This corresponds to AU + CRL<<1, 
which is the result when CNT = 2 shown in figure 2. The 
cases where is = -2 and is = -l are collected to be 



represented by 2 * mi n_p add i n g + 2 * min_p a dd i n g , i.e., by 
CNT = 3 and C RL< < 1 +C RL< < 1 shown in figure 2, and the cases 
where i=0 and i=l are collected to be represented by 
4 * g ( 1 ) +2 * g ( 2 ) , i.e., can be represented by CNT = 0 and 
AU<<2+AL<<1 shown in figure 2, and the cases where i=2 
and i = 3 are collected to be represented by 2*g(3)+l*g(4), 
i.e., by CNT=1 and AU<<1+AL shown in figure 2. Here, M 
in figure 2 represents the output of the register 171 and, 
further, +8 when CNT = 0 is a value for rounding and when 
there is no rounding, 0 may be input. In this way, 
selectors 111, 121 and 131 and shifters 141 and 151 are 
controlled according to the value of CNT, the above- 
described arithmetic is carried out, and the shifter 181 
is shifted when the value of CNT is 4, which is the last, 
thereby a result obtained by performing the filtering 
processing to the pixel of m-1 as an arithmetic result. 
In this way, as a result that the arithmetic as shown in 
figure 2 is performed at the arithmetic block 101, a 
result which is the same as that obtained by the original 
arithmetic can be obtained with less cycles than in a case 
where 9 cycles arithmetic are performed for m=l is 
performed with changing i from -4 to 4. 

Further, the values of the equation (1) when the 
value of i is changed from -4 to 4 in the arithmetic block 
102 corresponding to m=2 results in as follows. 
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■•in 



□ g (2 ) + = coef ( 4 ) Xg(2) 

S! 

HJ — 4 X AL 

SI 

D i = l 

g (2) + = coef (5) Xg(3) 
= 2 X AU 

i = 2 

g (2 ) -f = coef (6) X g ( 4 ) 
= 2 X AL 

i = 3 

g(2j +=coef (7) Xg(5) 
= 1 X AU 

i = 4 



i =-4 

g ( 2 ) t = coe f ( 0 ) Xmin_padding 
= 1 x crl 

i = - 3 

g ( 2 ) + = coe f ( 1 ) X min_paddi ng 
= 1 X CRL 

i = -2 

g (2) + = coef (2) Xmin_padding 
= 2 X CRL 

i = -l 

g(2) + = coef (3) Xg(i) 
= 2 X AU 



SI 

■*5 i = 0 



g ( 2 ) i -coe f ( 8 ) Xg(G) 
= 1X AL 

Among them, when i = -l and i =0 are collected, it results 
in 2 *g { 1 ) + 4 xg ( 2 ) and can be represented by AU<<1+AL<<2 
in the cycle of CNT = 0 in figure 2, when i = l and i = 2 are 
collected, it results in 2 * g ( 3 ) +2 * g ( 4 ) and can be 
represented by AU << 1 + AL<< 1 in the cycle of CNT=1 in 
figure 2, when i~3 and i=4 are collected, it results in 
l*g (5) + l*g (6) and can be represented by AU + AL in the cycle 
lOof cnr-2 in fiyure 2, and when i=-2 and i = - 3 and i = - 4 are 

m 

1J1 col 1 ec t ed, it results in 2 *min_padding+2 xmin_padding and 



>3 



□ 

"Si 

m 

rj 



^-i can be represented by CRL< < 1 +CRL< < 1 in the cycle of CNT = 3 
^ in figure 2. 

Here N in figure 2 represents the output at the 
previous cycle of the register 172 and, further, +8 when 
CNT = 0 is a value for rounding and when there is no rounding, 

0 ma y be input . 
In this way, the selectors 112, 122 and 132 and the 

shifters 142 and 152 are controlled according to the value 
of the CNT , and the arithmetic as shown in figure 2 is 
carried out in the arithmetic block 102, thereby a result 
which is the same as that obtained by the original 
arithmetic can be obtained with less cycles than in a case 
where 9 cycles arithmetic to are performed with changing 

1 from -4 to 4 successively for m=2 . in addition, it is 



possible to make the cycle number up t.o the conclusion 
of the arithmetic be the same as that of the arithmetic 
processing corresponding to the pixel data of m-1 by the 
arithmetic block 101 and, further make the cycle at which 
the arithmetic processinq is concluded be the same as that 
of the arithmetic block 101. 

Further, the value of the equation (1) when the value 
of i ic changed from 4 to 4 in the arithmetic block 103 
corresponding to m=3 results in as follows. 
iS i = -4 

in g (3) + = coef (0) Xmin_p adding 



M =1XCRL 
£3 i = -3 



^ g ( 3 ) +=coe f ( 1 ) X min_padding 

= 1 X CRL 



nj 

□ i = -2 



g (3) + = coef (2 ) Xg(l] 
-2 X AU 

i = -l 

g (3) + = coef ( 3 ) Xg(2) 
= 2 X AL 

i = 0 

g(3)+=coef (4) Xg(3) 
= 4 X AU 

i=l 



g ( 3 i i -coef ( 5 ) Xg(4] 
= 2 X AL 

i-2 

g (3) + = coef (6) Xg{5) 
= 2 X AO 

i = 3 

g(3) + = coef (7) Xg{6) 

-1 X AL 

i = 4 

Jj g (3) + = coef (8) X g (7) 

.'jn =ixau 

SJ 

^1 Among them, when i = -2 and i = -l are collected, it results 
s1 q in 2*g(l)+2*g(2) and can be represented by AU<<1+AL<<1 

p in the cycle of CNT=*0 in figure 2, when i = 0 and i = l are 

SJ 

collected, it results in 4 * g ( 3 ) i 2 * g ( 4 ) and can be 

n\ 

p represented by AU<<2+AL<<1 in the cycle of CNT=1 in 
figure 2, when i = 2 and i=3 are collected, it results in 
2 xg ( 5 ) + 1 * g ( 6 ) and can be represented by AU<<2+AL in the 
cycle of CNT=2 in figure 2, and when i=4 and i=-4 are 
collected, it results in 1 x g ( 7 ) +1 *min_padding and can 
be represented by AU + CRL in the cycle of CNT = 3 in figure 
2, and i=-3 is 1 x mi n_p a ddin g and it can be represented 
by CRL in the cycle of CNT=4 in figure 2. Here, P in 
figure 2 represents the output at the previous cycle of 
the register 173 and, further, +8 when CNT = 0 is a value 



for rounding and when there is no rounding, 0 may be input. 

In this way, selectors 113, 123 and 133 and shifters 
143 and 153 are controlled according to the value of the 
CNT, and the arithmetic as shown in figure 2 is carried 
out in the arithmetic block 103, thereby it is possible 
to obtain a result which is the same as that obtained by 
the original arithmetic with less cycles than in a case 
where 9 cycles arithmetic to are performed with changing 
i from -4 to 4 successively for m=3, 
ifl Further, the value of the equation (1) when the value 

m 

U]of i is changed from -4 to 4 in the arithmetic block 104 

Si 

M corresponding to m=4, becomes as follows. 

vi 

IB 

£3 i = -4 

g(3)+ = coef (0) Xmin_padding 

HI 



•==? 



^4 
HI 

m 



=1 XCRL 
i = -3 

g ( 3 } + = coef ( 1 ) X g ( 1 ) 
= 1 X AU 

i = -2 

g (3) +=coef (2) Xg{2) 
= 2 X AL 

i--l 

g(3) +-coef (3) Xg(3) 
= 2 X AU 

i = 0 



g ( 3 ) i -coef ( 4 ) Xg(4) 
= 4 X AL 

i = l 

g (3) +=coef (5) Xg(5) 
= 2 X AU 

i = 2 

g (3) + = coef (6) X g ( 6) 
-2XAL 

i = 3 

.-SO 9 < 3 > + = coef ( 7 ) X g ( 7 ) 

5l =-lXAU 

SI 

Sf i = 4 

SJ 

g ( 3) +=coef ( 8 ) X g ( 8 ) 

D =1 X AL 

Among them, when i = -3 and i = -2 are collected, it results 

□ in 1 * g { 1 ) +2 * g ( 2 ) and can be represented by AU + AL<<1 in 
the cycle of CNT = 0 shown in figure 2, when i = -l and i = 0 
are collected, it results in 2 x g ( 3 ) +4 * g ( 4 ) and can be 
represented by AU<<1+AL<<2 in the cycle of CNT=1 shown 
in figure 2, when i = l and i = 2 are collected, it results 
in 2 x g ( 5 ) +2 * g ( 6 ) and can be represented by AU<<1+AL<<1 
in the cycle of CNT = 2 shown in figure 2, and when i = 3 and 
i = 4 are collected, it results in 1 * g ( 7 ) + 1 * g ( 8 ) and can 
be represented by AU+AL in the cycle of CNT=3 shown in 
figure 2, and i = -4 is 1 * mi n_p a dd in g and it can be 



represented by CRL in the cycle of CNT=4 shown in figure 
2. Here, Q in figure 2 represents the output at the 
previous cycle from the register 174 and, further, +8 when 
CNT = 0 is a value for rounding, and when there is no 
rounding, 0 may be input. 

In this way, the selectors 114, 124 and 134 and the 
shifters 144 and 154 are controlled according to the value 
of the CNT, and the arithmetic as shown in figure 2 is 
carried out in the arithmetic block 104, thereby it is 
£i possible to obtain a result which is the same as that 
obtained by the original arithmetic, with less cycles 

SJ than in a case where 9 cycles arithmetic are performed 

SI 

with changing i from -4 to 4 successively tor m=4 . In 

S addition, it is possible to make the cycle number up to 

s \ 

Hi concluding the arithmetic be the same as that of the 
■p arithmetic processing corresponding to the pixel data of 

i 

m= 3 performed by the arithmetic block 103 and, further 
it is possible to make the cycle at which the arithmetic 
processing is concluded be the same as that by the 
arithmetic block 103. 

Further, the values of the equation (1) when the 
value of i is changed from -4 to 4 in the arithmetic block 
105 corresponding to m=5 become as follows. 
i = -4 

g(3) + = coef (0) Xg(l) 



SI 
SI 



■as 



= 1 X AU 

i = -3 

g (3) + = coef (1) Xg(2) 
= 1 X AL 

i = -2 

g(3)+-coef (2) Xg(3] 
= 2 X AU 

g(3) +=coef (3) Xg<4) 



■fl =2XAL 



1 = 0 



g { 3) + = coef (4) X g (5) 
= 4 X AU 

i = l 



^ g (3) + = coef (5) Xg(6) 



= 2 X AL 

i = 2 

g ( 3) + = coef ( 6) X g (7) 
= 2 X AU 

i = 3 

g(3)+ = coef (7) Xg(8) 
- 1 X AL 

i = 4 

g (3) +-coef (8) Xmax_padding 
= 1 X CRU 



Among them, when i--4 and i~-3 arc collected, it results 
in l*g ( i ) +1 *g (2 ) and can be represented by AU + AL in the 
cycle of CNT = 0 shown in figure 5, when i^-2 and i = -l are 
collected, it results in 2 * g ( 3 ) \ 2 * g { 4 ) and can be 
represented by AU<<1+AL<<1 in the cycle of CNT=1 shown 
in figure 5, when i = 0 and i = l are collected, it results 
in 4xg (5) +2*g (6) and can be represented by AU<<2+AL<<1 
in the cycle of CNT~2 shown in figure 5, and when i = 2 and 
i = 3 are collected, it results in 2 * g ( 7 ) + 1 x g ( 8 ) and can 
be represented by AU<<1+AL in the cycle of CNT=3 shown 

^ in figure 5, and i = 4 is 1 xmax_padding and it can be 

SJ 

Si represented by CRU in the cycle of CNT = 4 shown in figure 

SJ 

ifi 5 • Here, P in figure 5 represents the output at the 

s 

0 previous cycle from the register 175 and, further, +8 when 
■Rj CNT = 0 is a value for rounding, and when there is the 
□rounding, 0 may be input. 

In this way, the selectors 11, 125 and 135 and the 
shifters 145 and 155 are controlled according to the value 
of the CNT, and the arithmetic as shown in figure 5 is 
carried out in the arithmetic block 105, thereby it is 
possible to obtain a result which is the same as that 
obtained by the original arithmetic with less cycles than 
in a case where 9 cycles arithmetic are performed with 
changing the value i from -4 to 4 successively for m-5. 
Further, the value of the equation (1) when the value 



of i is changed from -4 to 4 in the arithmetic block 106 
corresponding to m=6 becomes as follows. 
i = -4 

g (3) + = coef (0) Xg(2] 
= 1 X AL 

i = -3 

g(3) + = coef (1) Xg (3) 
= 1 X AU 

i = -2 

n 

£ g(3)+-coef(2) Xg(4) 

DO 

^ — 2 X AL 

i 



S 



g (3 ) +=coef (3) X g ( 5 ) 



y =2XAU 



□ 



g (3) + = coef (4) Xg(6) 



= 4 X AL 



i = l 

g ( 3 ) + = coef (5) X g ( 7 ) 
= 2 X AU 

i = 2 

g ( 3 ) + = coef (6) X g ( 8 ) 
= 2 X AL 

i = 3 

g ( 3 ) + = coe f ( 1 ) Xmax_padding 



= 1 X CRU 

i = 4 

g (3) + = coef (8) X max_pa ddi ng 
= 1 X CRU 

Among them, i = -4 is l*g(2) and it can be represented by 
AL in the cycle of CNT = 1 shown in figure 6, when i = -3 and 
i = -2 are collected, it results in 1 * g ( 3 ) +2 * g ( 4 ) and can 
be represented by AU+AL<<1 in the cycle of CNT-1 shown 
in figure 5, when i = -l and i = 0 are collected, it results 
i£jin 2 x g ( 5 ) +4 x g { 6 ) and can be represented by AU<<1 + AL<<2 
inin the cycle of CNT = 2 shown in figure 5, when i = l and i = 2 
SJ are collected, it results in 2xg{5) +2xg (6) and can be 
represented by AU<<1+AL«1 in the cycle of CNT = 3 shown 
in figure 5, and when i = 3 and i = 4 are collected, it results 

SJ 

I in 1 *max_padding+ 1 *mx_padding and can be represented by 
B CRU+CRU in the cycle of CNT = 4 shown in figure 5. Here, 
S in figure 5 represents the output at the previous cycle 
from the register 176 and, further, +8 when CNT=0 is a 
value for rounding, and when there is no rounding, 0 may 
be input . 

In this way, the selectors 161, 162 and 163 and the 
shifters 164 and 165 are controlled according to the value 
of the CNT, and the arithmetic as shown in figure 5 is 
carried out in the arithmetic block 106, thereby it is 
possible to obtain a result which is the same as that 



obtained by the original arithmetic with less cycles than 
in a case where 9 cycles arithmetic are performed with 
changing the value i from -4 to 4 successively for m= 6 . 
In addition, it is possible to make the cycle number up 
to concluding the arithmetic be the same as the cycle 
number of the arithmetic processing corresponding to the 
pixel data of m=5 performed by the arithmetic block 105 
and, further it is possible to make the cycle at which 
the arithmetic processing is concluded be the same as that 
Hp of the arithmetic block 105. 

11} Further, the values of the equation (1) when the 

"Si 

! M value i is changed from -4 to 4 in the arithmetic block 
£ : 107 corresponding to m^7 becomes as follows. 

O i = -4 

SI 

J! g (3) + = coef (0) Xg (3) 

y * 

F? =1XAU 

■F* 

i = -3 

g (3) + = coef (1) Xg (4 ) 
= 1 X AL 

i = -2 

g (3) + = coef (2) X g (5) 
-2 x AU 

i = -l 

g { 3 ) + = coef (3 ) X g ( 6) 
= 2 X AL 



g (3) +=coef (4) Xg (7) 
= 4 X AU 

i = l 

g(3)+ = coef (5) Xg(8) 
= 2 XAL 

i = 2 

g(3)+=coef (6) Xmax_padding 
= 2 X CRU 

i = 3 

g(3)+*coef (7) Xmax_p adding 
= 1XCRU 

i = 4 

g ( 3 ) + = C oef (8 ) X max_padding 
^lXCRU 

Among them, when i = -4 and i = -3 are collected, it results 
in lxg (3) +l»g (4) and can be represented by AU + AL in the 
cycle of CNT = 1 shown in figure 5, when i = -2 and i = -l are 
collected, it results in 2 * g ( 5 ) +2 x g ( 6 ) and can be 
represented by AU<<1+AL<<1 in the cycle of CNT=2 shown 
in figure 5, when i = 0 and i = l are collected, it results 
in 4 * g ( 7 ) +2 * g { 6 ) and can be represented by AU<<2+AL<<1 
in the cycle of CNT = 3 shown in figure 5, and when i = 2 and 
i=3 are collected, it results in 

2*max padding+l*max_padding and can be represented by 



CRU<<1+CRU in the cycle of CNT = 4 shown in figure 5, and 
i = 4 is l*max_padding and it can be represented by CRU in 
the cycle of CNT = 5 shown in figure 5. Here, P in figure 
5 represents the output of the previous cycle from the 
register 177 and, further, +8 when CNT = 0 is a value for 
rounding, and when there is no rounding, 0 may be input. 

in this way, the selectors 117, 127 and 137 and the 
shifters 147 and 157 are controlled according to the value 
of the CNT, and the arithmetic as shown in figure 5 is 

n 

S carried out in the arithmetic block 107, thereby it is 

m 

Ul possible to obtain a result which is the same as that 

M 

N obtained by the original arithmetic with less cycles than 
^ in a case where 9 cycles arithmetic are performed by 
q changing the value i from -4 to 4 in order for m=7 . 
5! Further, the value of the equation (1) when the value 

p of i is changed from -4 to 4 in the arithmetic block 108 
corresponding to m= 8 becomes as follows. 
i = -4 

g(3)+-coef (0) Xg(4) 
= 1XAL 

i = -3 

g(3) + = coef (1) Xg (5) 
= 1 X AU 

i = -2 

g ( 3 ) + = coef ( 2 ) X g ( 6) 



= 2 X AL 

i = -l 

g (3) +=coef (3) X g £7) 
= 2 X AU 

i = 0 

g (3) +=coef (4 ) X g ( 8) 
= 4XAL 

i = l 

g(3)+=coef(5) Xmax_padding 
= 2 X CRU 



m 

aw 

Ul i = 2 

Sj g ( 3 ) + = coef { 6) Xmax_padding 

-2XCH0 



i = 3 



p 

l]f g (3) + = coef (7 ) Xmax padding 

"p =1XCRU 

pi 

i = 4 

g(3)+=coef(8) Xmax^p adding 
= 1 X CRU 

Among them, i — 4 is lxg(4) and it can be represented by 
AL in the cycle of CNT-1 shown in figure 5, when i = -3 and 
i = -2 are collected, it results in 1 * g ( 5 ) + 2 * g ( 6 ) and can 
be represented by AU+AL<<1 in the cycle of CNT = 2 shown 
in figure 5, when i— 1 and i = 0 are collected, it results 
in 2xg(7)+4xg(8) and can be represented by AU«1+AL«2 



in the cycle of CNT*=3 shown in figure 5, when i=l and i = 2 
are collected, it results in 2 x m ax_padding + 2 xmax_paddi ng 
and can be represented by CRU < < 1 +C RU<< 1 in the cycle of 
Cnt=4 shown in figure 5, and when i = 3 and i = 4 are collected, 
it results in 1 * max_paddi ng + 1 *max_paddi ng and can be 
represented by CRU+CRU in the cycle of CNT^S shown in 
figure 5. Here, S in figure 5 represents the output at 
the previous cycle from the register 178 and, further, 
+ 8 when CNT = 0 is a value for rounding and when there is 
no rounding, 0 may be input. 

In this way, the selectors 118, 128 and 138 and the 
shifters 148 and 158 are controlled according to the value 
of the CNT, and the arithmetic as shown in rigure 5 are 
carried out in the arithmetic block 108, thereby it is 
possible to obtain a result which is the same as that 
obtained by the original arithmetic with less cycles than 
in a case where 9 cycles arithmetic are performed by 
changing the value of i from -4 to 4 successively for m=S . 
In addition, it is possible to make the cycle number up 
to the concluding the arithmetic be the same as that of 
the arithmetic processing corresponding to the pixel data 
of m=7 preformed by the arithmetic block 107 which is and, 
further, it is possible to make the cycle at which the 
arithmetic processing is concluded be same as that of the 
arithmetic block 107. 



Further, as shown in figure 6, by the output 
selection circuit 7, the values of n=l and n = 2, which can 
be obtained respectively from the arithmetic block 101 
and 102 when CNT=4, are simultaneously output, the values 
of n=3 and n=4, which can be obtained respectively from 
the arithmetic block 103 and 104 when CNT=5, are 
simultaneously output, the values of n=5 and n=6, which 
can be obtained respectively from the arithmetic block 
105 and 106 when CNT=6, are simultaneously output, and 
% the values of n=7 and n = 8, which can be respectively 
|l obtained from the arithmetic block 107 and 108 when CNT=7, 



SJ are simultaneously output. As a result, the arithmetic 
results of the arithmetic blocks 101-108 are output by 
D a pipeline system for each two arithmetic blocks. 
I1J Thereby, it is possible to obtain the pixel data having 

m 

□ been subjected to the deblocking filtering processing. 

As described above, by performing the arithmetic 
processing as shown in figure 2 and figure 5 in the 
arithmetic blocks 101-108, it is possible to obtain an 
arithmetic result which is the same as the arithmetic 
result obtained when the same arithmetic for the 
respective cases of m=l~8 are performed in the prior art, 
with less cycles than those of the prior art. 

Further, according to the embodiment of the present 
invention, the contents of the processing arithmetic of 



respective arithmetic blocks 101-108 are controlled by 
the control circuit 1 in accordance with the cycles of 
the processing arithmetic, so that the cycles of the 
processing arithmetic performed in each combination, up 
to the conclusion of the respective filtering processing 
among a combination of the first arithmetic block 101 and 
the second arithmetic block 102, a combination of the 
third arithmetic block 103 and the fourth arithmetic 
block 104, a combination of the fifth arithmetic block 
105 and the sixth arithmetic block 106, a combination of 

fy>. 

yi the seventh arithmetic block 107 and the eighth 

"SI 

SI arithmetic block 108, respectively constituted selected 
•fl from the first to the eighth arithmetic blocks, should 
□ be the same and, further so that the filtering processing 
ilJ of respective combinations should be concluded in an 

m 

-D order successively. in addition, as shown in figure 6, 
the output selection circuit 7 is controlled so as to 
select the output from the arithmetic blocks in a unit 
of the combination of the arithmetic blocks and to perform 
a pipeline output. Therefore, the results of the 
filtering processing are successively output from the 
respective arithmetic blocks 101-108, thereby resulting 
in a further reduced cycle number for the filtering 
processing. 

In this way, according to the embodiment of the 



m 



present invention, it is possible to carry out the 

arithmetic with changing the value i from -4 to 4 for each 

pixel m shown in the above-described equation (1) by the 

respective arithmetic blocks 101-108 by quite a smaller 

number of cycles than the conventional cycle number. It 

is also possible to reduce the cycle number to a great 

extent by performing the arithmetic simultaneously by two 

arithmetic blocks and output the arithmetic results by 

two arithmetic blocks by a pipeline output. 

in this embodiment of the present invention, it is 

jl described a case where two pixel data g(x) and g(x + l) are 

^4 input per a cycle. However, according to the present 

■2 invention, if when it is possible to control the contents 

D of the processing arithmetic of respective arithmetic 

•W blocks 101-108 in accordance with the cycles of the 
f» 

F? processing arithmetic, so that the cycles of the 
processing arithmetic, among a combination of the first 
arithmetic block 101 and the second arithmetic block 102, 
a combination of the third arithmetic block 103 and the 
fourth arithmetic block 104, a combination of the fifth 
arithmetic block 105 and the sixth arithmetic block 106, 
a combination of the seventh arithmetic block 107 and the 
eighth arithmetic block 108 up to the conclusion of the 
respective filtering processings performed in each 
combination should be the same and, further that the 



1 



filtering processing of the respective combinations 
should be concluded in an order successively, it is 
possible to incrase the width of a bus for transmitting 
plural pixel data to more than 16 bits and to increase 
the number of pixel data input per a cycle to 3 or more, 
for example, to 4. Even in such case, the same effect 
as that obtained by the embodiment of the present 
invention can be obtained. 



APPLICABILITY IN I NDU S TORY 

As described above, the deblocking filter 
arithmetic apparatus and the deblocking filter 
arithmetic method according to the present invention are 
useful as a filter processing section and processing 
method for decoded pixel data in an apparatus for decoding 
image data which have been subjected to the encoding 
process to reproduce the same, and the deblocking filter 
arithmetic apparatus and the deblocking filter 
arithmetic method are particularly suitable for a case 
where the image data are those which are encoded by MPEG 4 
systems . 



